智能论文笔记

When Does Re-initialization Work?

Sheheryar Zaidi , Tudor Berariu , Hyunjik Kim , Jörg Bornschein , Claudia Clopath , Yee Whye Teh , Razvan Pascanu

分类：机器学习 | 计算机视觉 | (统计)机器学习

2022-06-20

观察到在训练期间重新定位神经网络，以改善最近的作品中的概括。然而，它既不在深度学习实践中被广泛采用，也不经常用于最先进的培训方案中。这就提出了一个问题，即何时重新定位起作用，以及是否应与正规化技术一起使用，例如数据增强，体重衰减和学习率计划。在这项工作中，我们对标准培训的经验比较进行了广泛的经验比较，并选择了一些重新定位方法来回答这个问题，并在各种图像分类基准上培训了15,000多个模型。我们首先确定在没有任何其他正则化的情况下，这种方法对概括始终有益。但是，当与其他经过精心调整的正则化技术一起部署时，重新定位方法几乎没有给予概括，尽管最佳的概括性能对学习率和体重衰减超参数的选择不太敏感。为了研究重新定位方法对嘈杂数据的影响，我们还考虑在标签噪声下学习。令人惊讶的是，在这种情况下，即使在存在其他经过精心调整的正则化技术的情况下，重新定位也会显着改善标准培训。

translated by 谷歌翻译

Current State and Future Directions for Learning in Biological Recurrent Neural Networks: A Perspective Piece

Luke Y. Prince , Roy Henha Eyono , Ellen Boven , Arna Ghosh , Joe Pemberton , Franz Scherr , Claudia Clopath , Rui Ponte Costa , Wolfgang Maass , Blake A. Richards

分类：人工智能

2021-05-12

我们简要介绍了从实验神经科学的研究结果对生物学学习的共同假设，并以经常性神经网络的梯度学习效率对比。本评论中讨论的关键问题包括：突触可塑性，神经电路，理论实验划分和客观功能。我们在设计新的研究时，我们的建议与理论和实验神经科学家的建议有助于为这些问题带来清晰度。

translated by 谷歌翻译

Overcoming catastrophic forgetting in neural networks

James Kirkpatrick , Razvan Pascanu , Neil Rabinowitz , Joel Veness , Guillaume Desjardins , Andrei A. Rusu , Kieran Milan , John Quan , Tiago Ramalho , Agnieszka Grabska-Barwinska

分类：

2016-12-02

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Neural networks are not, in general, capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks which they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on the MNIST hand written digit dataset and by learning several Atari 2600 games sequentially.

translated by 谷歌翻译

Constructing Organism Networks from Collaborative Self-Replicators

Steffen Illium , Maximilian Zorn , Cristian Lenta , Michael Kölle , Claudia Linnhoff-Popien , Thomas Gabor

分类：神经与进化计算 | 机器学习

2022-12-20

We introduce organism networks, which function like a single neural network but are composed of several neural particle networks; while each particle network fulfils the role of a single weight application within the organism network, it is also trained to self-replicate its own weights. As organism networks feature vastly more parameters than simpler architectures, we perform our initial experiments on an arithmetic task as well as on simplified MNIST-dataset classification as a collective. We observe that individual particle networks tend to specialise in either of the tasks and that the ones fully specialised in the secondary task may be dropped from the network without hindering the computational accuracy of the primary task. This leads to the discovery of a novel pruning-strategy for sparse neural networks

translated by 谷歌翻译

Empirical Analysis of Limits for Memory Distance in Recurrent Neural Networks

Steffen Illium , Thore Schillman , Robert Müller , Thomas Gabor , Claudia Linnhoff-Popien

分类：机器学习 | 计算机视觉

2022-12-20

Common to all different kinds of recurrent neural networks (RNNs) is the intention to model relations between data points through time. When there is no immediate relationship between subsequent data points (like when the data points are generated at random, e.g.), we show that RNNs are still able to remember a few data points back into the sequence by memorizing them by heart using standard backpropagation. However, we also show that for classical RNNs, LSTM and GRU networks the distance of data points between recurrent calls that can be reproduced this way is highly limited (compared to even a loose connection between data points) and subject to various constraints imposed by the type and size of the RNN in question. This implies the existence of a hard limit (way below the information-theoretic one) for the distance between related data points within which RNNs are still able to recognize said relation.

translated by 谷歌翻译

VoronoiPatches: Evaluating A New Data Augmentation Method

Steffen Illium , Gretchen Griffin , Michael Kölle , Maximilian Zorn , Jonas Nüßlein , Claudia Linnhoff-Popien

分类：计算机视觉 | 机器学习

2022-12-20

Overfitting is a problem in Convolutional Neural Networks (CNN) that causes poor generalization of models on unseen data. To remediate this problem, many new and diverse data augmentation methods (DA) have been proposed to supplement or generate more training data, and thereby increase its quality. In this work, we propose a new data augmentation algorithm: VoronoiPatches (VP). We primarily utilize non-linear recombination of information within an image, fragmenting and occluding small information patches. Unlike other DA methods, VP uses small convex polygon-shaped patches in a random layout to transport information around within an image. Sudden transitions created between patches and the original image can, optionally, be smoothed. In our experiments, VP outperformed current DA methods regarding model variance and overfitting tendencies. We demonstrate data augmentation utilizing non-linear re-combination of information within images, and non-orthogonal shapes and structures improves CNN model robustness on unseen data.

translated by 谷歌翻译

AWT -- Clustering Meteorological Time Series Using an Aggregated Wavelet Tree

Christina Pacher , Irene Schicker , Rosmarie deWit , Katerina Hlavackova-Schindler , Claudia Plant

分类：机器学习

2022-12-13

Both clustering and outlier detection play an important role for meteorological measurements. We present the AWT algorithm, a clustering algorithm for time series data that also performs implicit outlier detection during the clustering. AWT integrates ideas of several well-known K-Means clustering algorithms. It chooses the number of clusters automatically based on a user-defined threshold parameter, and it can be used for heterogeneous meteorological input data as well as for data sets that exceed the available memory size. We apply AWT to crowd sourced 2-m temperature data with an hourly resolution from the city of Vienna to detect outliers and to investigate if the final clusters show general similarities and similarities with urban land-use characteristics. It is shown that both the outlier detection and the implicit mapping to land-use characteristic is possible with AWT which opens new possible fields of application, specifically in the rapidly evolving field of urban climate and urban weather.

translated by 谷歌翻译

Multi-view Graph Convolutional Networks with Differentiable Node Selection

Zhaoliang Chen , Lele Fu , Shunxin Xiao , Shiping Wang , Claudia Plant , Wenzhong Guo

分类：机器学习

2022-12-09

Multi-view data containing complementary and consensus information can facilitate representation learning by exploiting the intact integration of multi-view features. Because most objects in real world often have underlying connections, organizing multi-view data as heterogeneous graphs is beneficial to extracting latent information among different objects. Due to the powerful capability to gather information of neighborhood nodes, in this paper, we apply Graph Convolutional Network (GCN) to cope with heterogeneous-graph data originating from multi-view data, which is still under-explored in the field of GCN. In order to improve the quality of network topology and alleviate the interference of noises yielded by graph fusion, some methods undertake sorting operations before the graph convolution procedure. These GCN-based methods generally sort and select the most confident neighborhood nodes for each vertex, such as picking the top-k nodes according to pre-defined confidence values. Nonetheless, this is problematic due to the non-differentiable sorting operators and inflexible graph embedding learning, which may result in blocked gradient computations and undesired performance. To cope with these issues, we propose a joint framework dubbed Multi-view Graph Convolutional Network with Differentiable Node Selection (MGCN-DNS), which is constituted of an adaptive graph fusion layer, a graph learning module and a differentiable node selection schema. MGCN-DNS accepts multi-channel graph-structural data as inputs and aims to learn more robust graph fusion through a differentiable neural network. The effectiveness of the proposed method is verified by rigorous comparisons with considerable state-of-the-art approaches in terms of multi-view semi-supervised classification tasks.

translated by 谷歌翻译

MedalCare-XL: 16,900 healthy and pathological 12 lead ECGs obtained through electrophysiological simulations

Karli Gillette , Matthias A. F. Gsell , Claudia Nagel , Jule Bender , Bejamin Winkler , Steven E. Williams , Markus Bär , Tobias Schäffter , Olaf Dössel , Gernot Plank

分类：机器学习

2022-11-29

Mechanistic cardiac electrophysiology models allow for personalized simulations of the electrical activity in the heart and the ensuing electrocardiogram (ECG) on the body surface. As such, synthetic signals possess known ground truth labels of the underlying disease and can be employed for validation of machine learning ECG analysis tools in addition to clinical signals. Recently, synthetic ECGs were used to enrich sparse clinical data or even replace them completely during training leading to improved performance on real-world clinical test data. We thus generated a novel synthetic database comprising a total of 16,900 12 lead ECGs based on electrophysiological simulations equally distributed into healthy control and 7 pathology classes. The pathological case of myocardial infraction had 6 sub-classes. A comparison of extracted features between the virtual cohort and a publicly available clinical ECG database demonstrated that the synthetic signals represent clinical ECGs for healthy and pathological subpopulations with high fidelity. The ECG database is split into training, validation, and test folds for development and objective assessment of novel machine learning algorithms.

translated by 谷歌翻译

A New Graph Node Classification Benchmark: Learning Structure from Histology Cell Graphs

Claudia Vanea , Jonathan Campbell , Omri Dodi , Liis Salumäe , Karen Meir , Drorith Hochner-Celnikier , Hagit Hochner , Triin Laisk , Linda M. Ernst , Cecilia M. Lindgren

分类：机器学习 | 计算机视觉

2022-11-11

We introduce a new benchmark dataset, Placenta, for node classification in an underexplored domain: predicting microanatomical tissue structures from cell graphs in placenta histology whole slide images. This problem is uniquely challenging for graph learning for a few reasons. Cell graphs are large (>1 million nodes per image), node features are varied (64-dimensions of 11 types of cells), class labels are imbalanced (9 classes ranging from 0.21% of the data to 40.0%), and cellular communities cluster into heterogeneously distributed tissues of widely varying sizes (from 11 nodes to 44,671 nodes for a single structure). Here, we release a dataset consisting of two cell graphs from two placenta histology images totalling 2,395,747 nodes, 799,745 of which have ground truth labels. We present inductive benchmark results for 7 scalable models and show how the unique qualities of cell graphs can help drive the development of novel graph neural network architectures.

translated by 谷歌翻译